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Abstract. We use the Bellman function method to give an elementary proof of a sharp 
weighted estimate for the Haar shifts, which is linear in the A2 norm of the weight and 
in the complexity of the shift. Together with the representation of a general Calderon- 
Zygmund operator as a weighted average (over all dyadic lattices) of Haar shifts, cf. [lUI 
111) , it gives a significantly simpler proof of the so-called the A2 conjecture. 

The main estimate (Lemma 6.1 1 is a very general fact about concave functions, which 
can be very useful in other problems of martingale Harmonic Analysis. Concave functions 
of such type appear as the Bellman functions for bounds on the bilinear form of martingale 
multipliers, thus the main estimate allows for the transference of the results for simplest 
possible martingale multipliers to more general martingale transforms. 

Note that (although this is not important for the A2 conjecture for general Calderon- 
Zygmund operators) this elementary proof gives the best known (linear) growth in the 
complexity of the shift. 
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Notation 

^ a dyadic lattice in M or M*^; 

chid / the collection of children of the interval (cube) /; 

chldfe / the collection of children of the order k of the interval (cube) /; the collection 

chldo(/) consist of the interval I; 
\E\ the d-dimensional Lebesgue measure of a set £^ C M*^; 

(/)/' /t / average of / over /, (/)^ = |/|-^ j{x)dx\ 

averaging operator, E^f := [f^'i-j] 
Aj Martingale difference operator, := —E^ + X]jechid(/) -^j' 

L'^{w) the weighted space, ll/ll^j^^) = Jr" \fix)\'^w{x)dx. 



1. Introduction 

The famous Hunt-Muckenhoupt-Wheeden theorem states that a Calderon-Zygmund 
operator T is bounded in the weighted space L'^{w) = L'^(W^,w) if and only it the weight 
w satisfies the so-called Muckenhoupt condition 

{A2) sup(^\Q\~^ J wdx^(^Q\~^l w'^dx^ -.[w\j^^< 00, 

where the supremum is taken over all cubes Q in W^. The quantity [w]^^^ is called the 
Muckenhoupt (or A2) norm of the weight w (although it is clearly not a norm). 

It has been an old problem to describe how the norm of a Calderon-Zygmund operator 
in the weighted space L'^{w) depends on the Muckenhoupt norm [w]^^^ of w. A conjecture 
was that for a fixed Calderon-Zygmund operator T its norm in L'^{w) is bounded by 
C ■ [w]j^^, where the constant C depends on the operator T (but not on the weight w). 
Simple counterexamples demonstrate that for the classical operators like Hilbert Transform 
or Riesz Transform, a better estimate than C ■ [w]a2 generally not possible. 

For operators that are not "too singular" better than linear estimates are possible. For 
example, for the convolution with the Poisson kernel Pr its norm in L'^{w) is estimated by 
CHV2^ i.e. 

l|p.*/lL.(^)<c[<ll/lL.(^). 

This can be easily explained: the norm of the averaging operator E^f = {f)j'i-j is 
exactly {w)y'^ {w~^)y^ , and the convolution with the Poisson kernel can be estimated by 
an average of the averaging operators. Also, it is not hard to show that this bound is sharp, 
meaning that for each weight w convolution with some Pr has the norm at least c[w]y^. 

But for the classical singular integral operators like Hilbert Transform or Riesz Trans- 
forms the linear in [w]^ estimate is the best one can hope to get. The conjecture about 
a linear in [w]^^ estimate of the norm for such and more general operators has become 
known as the A2 conjecture. 



SHARP A2 ESTIMATES OF HAAR SHIFTS VIA BELLMAN FUNCTION 



3 



For the maximal fmiction, the estimate C ■ ['w]j^^ was proved by S. Buckley |5j: he 
also proved that this estimate is optimal for the maximal function. The first result for a 
"singular integral" operator was due to J. Wittwer |27j . who proved the A2 conjecture for 
the Haar multipliers. 

Using this result and Bellman function technique S. Petermichcl and A. Volberg f23j 
proved the A2 conjecture for the Beurling-Ahlfors operator (convolution with 7r~^z^^ in 
C). An alternative proof was given a little later by O. Dragicevic and A. Volberg [?J via the 
representation of the Beurling-Ahlfors Transform as an average of Haar multipliers over 
all dyadic lattices. 

The result for other Calderon-Zygmund operators, in particular for the Hilbert Trans- 
form (the convolution with (vrx)"^ on R) remained open for some time. 

The next breakthrough was made by S. Petermichl [20] who proved the A2 conjecture 
for the Hilbert Transform. She used the representation of the Hilbert Transform as the 
average over all (translated and dilated dyadic lattices) of a simple dyadic transformation 
(the so called Haar shift), and proved the A2 conjecture for this operator. Later ^21j she 
used similar ideas to prove the A2 conjecture for the Riesz Transforms. 

Note that it took a lot of time and effort to go from J. Wittwer's [2^ estimate for the 
simplest dyadic operator to the S. Petermichl's [20j estimate for the Haar shift, despite the 
fact that the Haar shift was just a little bit more complicated than the Haar multiplier. 

Lately, there was a lot of activity, that eventually lead to the complete solution of the 
A2 conjecture for all Calderon-Zygmund operators. Namely, M. Lacey, S. Petermichl and 



M. Reguera [13j proved the A2 conjecture for all so-called Haar shifts, see Definitions 2.1 



2.2 below, generalizations of the operators used in [20j. This result immediately implies A2 



conjecture for all operators that can be represented as averages of Haar shifts of fixed com- 
plexity (see Definition 2.1 below), in particular it gave another proof for Riesz transforms 
in M''. 

The technique used in [13] was a very clever application of the stopping moment reason- 
ings, and was based on the result from |17j about two-weight estimates for the so-called 
well-localized operators. Unfortunately, the estimates of the norm in [13] grew exponen- 
tially in the complexity of the Haar shift, so A2 conjecture for general Calderon-Zygmund 
operators remained open. 

For general Calderon-Zygmund operators the conjecture was finally settled by T. Hyton- 
en |1U| . One of the crucial components in his proof was the representation of an arbitrary 
Calderon-Zygmund operator as a weighted average (over all translated dyadic lattices) of 
the Haar shifts (of all possible complexities), where the weights decrease exponentially in 
the complexitjj^ This would imply the A2 conjecture if one could prove the A2 conjecture 
for the general Haar shifts with the estimates depending sub-exponentially (for example 
polynomially) on the complexity. 



In fact, for general Calderon-Zygmund operators one also should add the averages of so-called para- 
products and their adjoint. But the paraproducts are the operators of fixed complexity, so any known linear 
in [w]^ estimate for such operators would work. We say more about estimates for paraproducts later in 
Sectionls] 
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This program was realized later in [TT], where the polynomial in complexity (and of 
course linear in [^i']^^) estimate of the norm was obtained for general Haar shifts; a simpler 
representation of a Calderon-Zygmund operator as a weighted average of the Haar shifts 
was also presented there. 

The original proof in |10j used a result from a very technical paper |19j . where the A2 
conjecture was reduced to a Sawyer type testing condition and as a corollary to a weak 
type estimate. 

We should also mention as an interesting fact that all the proofs mentioned above used 
the results and / or technique from two papers by Nazarov-Treil-Volberg [T8] about two 
weight estimates for martingale transforms 

In the present paper we present a simple proof of the A2 conjecture for arbitrary Haar 
shifts with linear in complexity estimates. We get this estimate directly from the J. Wittwer 
result |27j {A2 conjecture for the simplest Haar multipliers) using the Bellman function 
technique. Together with the representation of a Calderon-Zygmund operator as a weighted 
average of Haar shifts from [lOj or [llj it gives the A2 conjecture for general Calderon- 
Zygmund operators. 

The proof is really simple and elementary; it is significantly simpler than any previous 
proof for the Haar shifts. It is really a shame that this proof was not discovered earlier. 

The essence of the proof is a simple result about convex functions. Functions of such 
type appear often when one uses Bellman function method in dyadic Harmonic analysis, 
so the main result can be used for the transference, when one extends result obtained for 
the simplest dyadic model to more general martingale ones. 

Finally, we should also mention preprint |23j where the Bellman function method was 
used to prove the estimates for the Haar shifts of complexity and 1 (in particular for the 
S. Petermichl's Haar shifts [20j). 

2. Main objects. 
2.1. Dyadic lattices. The standard dyadic system in is 

^0 := y ^0, ^0 := {2'= ([0,1)^^ + 771) :mGZ^}. 
feez 

For I ^ and a binary sequence uj = {ujj)'jL_^^ G ({0, l}'^)^, let 

j<k 

Following Nazarov, Treil and Volberg |17| Section 9.1], consider general dyadic systems of 
the form 

S)=S)'^ ■= {I+oj : / G = (J 

fcez 

Given a cube I = x + [0, £Y, let 

chld(/) := {x + ?7^/2 + [0, £/2)'^ : ?? G {0, 1}'^} 
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denote the collection of dyadic children of I. Thus = |J{chld(/) : / G ^^}. Note 

that, in line with [171 E] but contrary to [lO], we use the "geometric" indexing of cubes, 
where larger k refers to larger cubes, rather than the "probabilistic" indexing, where larger 
k would refer to finer sigma- algebras. 

2.2. Martingale difTerence decompositions and Haar functions. For a cube / in 

let 



JechidC/) 

It is well known that for an arbitrary dyadic lattice ^ every function / G L^(M'^) admits 
the orthogonal decomposition 

Given a cube Q in W^, any function in the martingale difference space ^qL"^ is called 
a Haar function (corresponding to Q) and is usually denoted by h^. Note, that here Hq 
denotes a generic Haar function, not any particular one. 

In other words, a Haar function is supported on Q, constant on the children of Q 
and orthogonal to constants. 

2.3. Dyadic shifts. 

Definition 2.1. Let m,n G N. According to [11] an elementary dyadic shift with param- 
eters m, n is an operator given by 

S/:=E E \Q\-\f,h%)h%, 

QeS* Q',Q"(^S>,Q',Q"CQ, 

e(Q')=2~"^e{Q),e(Q")=2-"e{Q) 

where /ig, and /ig,, are (non-weighted) Haar functions for the cubes Q' and Q" respectively, 
subject to normalization 

(2 ^) \\h'^"\\ ■ Wh'^' II < 1 

X'^'-^J II lloo Ir^Q/'lloo _ 

Notice that this implies, in particular, that 

(2.2) §f{x)=^\Q\-^ a^{x,y)f{y)dy, supp Cg C Q x Q, ||ag ||oo < 1 , 
where 

(2.3) ag(x,y)= E h%,{x)h%{y). 

Q',Q"(i&,Q',Q"cQ, 
e{Q')=2-'^e{Q),£{Q")=2-"e{Q) 

We will call the number max(m, n) -|- 1 the complexity of the dyadic shift. Note that in 
|10^ [TT] the complexity was defined as max(m, n); we use max(m, n) -|- 1, because it will be 
more convenient for our purposes. 
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We will use a little more general definition of a Haar shift. 
Definition 2.2. A Haar shift S of complexity n is is given by 

where the operators S„ act on A" and can be represented as integral operators with 
V Q 

kernels a„, ||a„||oo < The latter means that for all f,g £ A"L^ 

{^nf'9)= / anix,y)f{y)g[x)dxdy. 



We can always think that our dyadic shifts S are finite dyadic shifts meaning that only 
finitely many Q's are involved in its definition above. All estimates will be independent of 
this finite number. 



3. Bellman function for sharp weighted estimates of the dyadic 

martingale multipliers 

The simplest example of a Haar shift is the so-called dyadic martingale multiplier. 
Namely, let ^ be the standard dyadic lattice in M. For an interval I £ & let be 
the standard L^-normalized Haar function, hj := — Ij^)) where Ii and I2 are 

the left and the right halves of / respectively. Given a numerical sequence a = 
\aj\ < 1, define the operator by 

The family of operators T„ can be considered to be the simplest dyadic analog of singular 
integral operators, so the sharp weighted estimates for such operators were the natural 
thing to try before attacking the case of general Calderon-Zygmund operators. 

It was shown by J. Wittwer [27j that if a weight w satisfies the dyadic Muckenhoupt 
condition 

sup{w)j{w'^)j =: [w]'\^ < 00 

the operators are (uniformly in o", \aj\ < 1) bounded in L'^{w) by Ci [w]'^^, where Ci is 
an absolute constant. 

Denoting by Ii and I2 he children of I one can rewrite the estimate of the norm as as 

(3.1) - • - • 1^1 ^ c-Ml 

for all / e L'^{w), g G L^{w-^)] here C = Ci/4. 
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3.1. Bellman function for the martingale multipliers and its properties. Follow- 
ing the standard Bellman function technique, cf. |14tll5j. let us define the Bellman function 
for the problem. Let ^ > 1, and let Ii and I2 denote the children of an interval /. Fix a 
dyadic interval Iq, for example /q = [0, 1] and for real numbers f , g, F, G, u, v satisfying 

(3.2) u,v>0, 1 < uv < ^, f^ < Fv, g2 < Gu 

define the function B = Bj^ by 



{9)1, - {9) 



^^(f,g,F,G,u,v):=|/orisup | 

K^Si-.IClo 

where the supremum is taken over all (real-valued) functions /, g and the (dyadic) Muck- 
enhoupt weights w > on Iq, such that 



(3.3) 



sup {w)Aw ), < A, 

I&&,IClo 



(3.4) = f , {rw)j^^ = F, 

(3.5) H7„=u, W)/o=^- 

Here again Ii I2 are the children of the interval I. 



{9) 



lo 



{9'w-': 



lo 



3.1.1. Properties of B^. The Bellman function B^ satisfies the following properties: 

(i) Domain Dom(i3^) of Ba is given by (3.2); this means that for every set of numbers 



f, g, F, G, u, V satisfying (3.2) there are functions /, g and a weight w satisfying 



(3.4), (3.5), so the supremum is well defined (not equal —00). It also mean, that 



if the variables are the corresponding averages, they must satisfy the constrains 



(3.2). 



(ii) Range: < i3^(f, g, F, G, u, v) < CAF^/^G^/^; 

(iii) The main inequality: for any three 6-tuples (let us call them X = (f , g, F, G, u, v), 
Xi = (fi,gi,Fi,Gi,ui,vi), X2 = (f2,g2,F2,G2,U2, V2)) in the domain satisfying 
X = {Xi+ X2)/2 

B^{X) > (B^iXi) + B^{X2))/2 + |fi - f2| • |gi - g2| 

Note that because of correct homogeneity, the function Ba does not depend on the choice 
of the interval Iq. 

Let us explain the properties of the Bellman function B^. The property (i) is easy to 
explain. Namely, for any weight w and any interval I 

{w)j{w-')j>l. 

On the other hand the dyadic Muckenhoupt condition A2 means that {w)j{w-^)j < A 
for any I G so the inequalities 1 < uv < A must be satisfied. On the other had, it 
is an easy exercise to show that for any uv satisfying 1 < uv < A one can find a dyadic 



Muckenhoupt weight w satisfying (3.3) and (3.5); one just can consider functions constant 
on the children of Iq. 
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The inequaUty < Fv is just the Cauchy-Schwartz inequaUty: 



fdx 



< 



f'^wdxj 



1/2 



W 



1/2 



On the other hand, given a weight w it is not hard to find a function / satisfying (3.4): we 
just put / = vT^/^i + (/), where J^^ cj)wdx = and /^-^ (jP"w = F — f^. 

Property (ii) is pretty straightforward: B^{X) > be the definition, and the inequahty 
13 ^{X) < CAF^/^G^/^ follows from ( |3l| ). 

Let us now explain the main inequality (iii). Let X,Xi,X2 € Dom(;B^), X = {Xi + 
X2)/2, and let if and be the children of the interval Iq. Consider functions / and g and 
a weight w on Iq such that 



(3.6) 



X 



1,2 



^1.2 ^1.2 ^1.2 ^1.2 ^1,2 'l,: 



(we just construct the functions on the intervals and I2 with the prescribed averages 
there: as we just discussed above such functions always exist). Then 

is the vector of corresponding averages over Iq. Denoting by Ji and I2 the children of an 
interval / we can write 



1-1 



E \if)i.-if)i. 

I&Si: Ido 



I\ 



|fi 



|gl - g2| + 1^0 



{9)1, - {9) 



Taking the supremum over all /, g and w satisfying (3.6) we get in the right side 

|fi - h\ ■ |gi - g2| + {B^{Xi) + B^{X2)) /2. 

The supremum in the left side is clearly bounded above by Bj^{X)^ which proves the main 
inequality (iii). □ 

Remark 3.1. Notice that the domain of is not convex, the reason being that the set 
{u,v > : uv < A} is not convex. 

However, all the other constrains define the convex sets, so the non-convexity of the 
domain is completely given by the behavior of the coordinates u, v. Namely, if the points 
X+, X_ e are in the domain, and for each X = Xq = 6X+ + (1 - 9)X-, < < 1, we 
know that uv < A, then all Xg, < < 1, are in the domain. 

Remark 3.2. The main inequality (iii) implies that the functions B^ are concave, namely 
that if the points Xi, X2 and the whole interval [Xi,X2] = {(1 - 9)Xi + 6X2 : 9 G [0, 1]} 
are in Dom;B^, then for all 9 £ [0, 1] 

(3.7) B^{{1 - 9)Xi + 9X2) > (1 - 9)B^{Xi) + 9B^{X2). 
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Condition (iii) implies the so called midpoint concavity, i.e. (3.7) with 9 = 1/2. But that 
is a well-known fact in convex analysis that for locally bounded concave (convex) functions 
midpoint concavity (convexity) is equivalent to the regular concavity (convexity), i.e. to 
the inequality for all 9 G [0, 1]. 

The classical reference here would be the monograph 0, see Statement 111 in Section 
3.18 where it was stated for convex functions. Note, that only upper bound was assumed 
in [9j , so for the equivalence of midpoint concavity and concavity one can only assume that 
a function is locally bounded below. 

Recall also, that any bounded above convex (respectively bounded below concave) func- 
tion is continuous, and even locally Lipschitz, see jH Theorem 4.1.1]. 



4. The main result: sharp weighted estimates of the Haar shifts 
The theorem below is formally the main result of the paper. 

Let a dyadic lattice ^ be fixed. Recall that a weight w satisfies the dyadic Muckenhoupt 
condition A2 (with respect to the dyadic lattice ^) if 

sup{w)j{ur^)j =: < 00 



Theorem 4.1 (Main result). Let E> be a Haar shift in M*^ (in the sense of Definition 2.2) 
of complexity n, and let a weight w satisfies the dyadic Muckenhoupt condition Ag. Then 



the norm o/S in L?'{w) is at most 32An2'^'^ "^C [w]'^^, where C is the constant from (3.1). 

5. Preliminaries for the proof 
5.1. Some simple reductions. Let us simplify the problem a bit. 

5.1.1. First slicing. Let us recall that the Haar shift S was represented as 

= X] 1^1 W '^Q(^'y)fiy)dy , suppa^ C Q x Q, lla^Hoo < l, 



where are given by (2.3) 



Let n be the complexity of the shift S. We can decompose S as the sum S = Ylk=o ^k, 
where for < A; < n the operator = is defined by taking the sum only over the cubes 
Q of length 2^=+"^ j G Z. If we denote £k = CI := {Q e & : i{Q) = 2^=+"^ j G Z}, then 
we can write 



Sfc/= E 1^1"' / aQ{x,y)fiy)dy 



The advantage of the operators Sk is that they are martingale transforms when we are 
moving n units of time at once, so it is possible to apply Bellman function method. 
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1 


to 


3 


4 



Figure 1. Putting cubes on the line 



For the readers who are not comfortable thinking in terms of martingales we will just 
write the estimate 

(5.1) \{^kf,9)\< IQr'l|AJ/||i||AJg||i 

Q&Ck 

and notice that the functions A" f and A"' q are constant on all cubes R £ Ck, Q. 

Q-' Kj ^ ^ 

5.1.2. Reduction to the real line. First of all it is sufhcient to deal only with the Haar shifts 
on a dyadic lattice in M (we can assume that we are dealing with the standard dyadic lattice, 
but the proof for a general dyadic lattice is the same). 

So for the slice Sk of the Haar shift in M*^ we make its representation on the real line by 
"arranging" cubes along the real line. Namely, for a dyadic cube Q in R"^ take a dyadic 
interval /, |/| = \Q\; this interval will correspond to the cube Q. Dividing one side of 
Q into 2, split Q into 2 equal parallelepipeds: then pick a (one to one) correspondence 
between the parallelepipeds and the children of / (the choice of the correspondence does 
not matter). 

Then by dividing a longer side split each parallelepiped into two equal ones, and and 
make a correspondence between the new parallelepipeds and the children of the corre- 
sponding intervals, see Fig. [T| 

After d divisions we end up with the correspondence between the children of Q and the 
intervals J G chldd(/). Note that the intervals J £ chldfc(/), 1 < k < d correspond to some 
"almost children" R oi Q. Here by an "almost child" we mean a parallelepiped some with 
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some of the sides coinciding with the sides of Q and the other sides being halves of the 
corresponding sides of Q. The construction for c? = 1 is presented on Fig. [T| 

One can run this construction up, i.e. for / being the parent of / and for Q being 
the grandparent of Q of order d we can construct, using the procedure described above, 
a bijection $ from the children and almost children of Q to the intervals J G chldfc(/), 
\ < k < such that ^{Q) = I- To assure that $(Q) = / one just need at every 
division make the image of the almost child containing Q to be the dyadic interval (of the 
appropriate length) containing /. 

A function / € L\^^{W^) will be transferred to a function g € ^[^(.(M) such that = 
((7),, for aUQ,/, / = ci>(Q). 

Let us see what is the "price" to pay for this reduction. First, if S is a Haar shift (or its 
slice) of a complexity n in W^, then its model in M will be a shift of complexity nd. 

The (dyadic) A2 norm of the weight on M is svcp ^{w) ^{w) ^, where the supremum is 
taken over all dyadic cubes and over all their almost children. If R is an almost child of Q, 
then 



A? 



w{x)dx < / w{x)dx, / w{x) dx < w{x) dx, 
R Jq Jb^ Jq 

and \R\ > 2~'^+^|(5|. So after the transfer to the real line, the Muckenhoupt norm [w 
of the weight increases at most 2^^'^"^^ times. 
5.2. A technical lemma. 

Lemma 5.1. Let X,X+,X^ G Dom^^, X = (X+ + X_)/2. Then the interval 

belongs to Dom;B^/, where A' = 9A/8, i.e. for all 6 G (0, 1) we haveXg = 9X++{1-9)X^ G 

Dom^ . 



Proof. As it was discussed above in Remark 3.1, we only need to check the non-convex 
constrain uv < A. Because of scaling, it is sufficient to show that if end points and a 
center of an interval are in the set {{x,y) G '■ x,y > 0, xy < 1}, then for any point on 
the interval xy < 9/8. 

It is no hard to see that the worst case scenario is when center and one endpoint is on 
the line y = 1/x (and the other endpoint is on a coordinate axis). Since the rescaling 
x I—)- ax, y I—)- a^^y, a > does not change the domains xy < C, we can assume without 
loss of generality that the center of the interval is at the point (1,1). Then the picture for 
worst case scenario will be as on Fig. [2| or its reflection in the line y = x. 

It is an easy exercise to find that the maximal value of xy on this interval is 9/8. □ 

6. The main estimate 

The following lemma is in the heart of the matter. In fact, it can be considered as the 
main result of the paper. 

Lemma 6.1 (The main estimate). Let Ba, A > 1 be a family of functions satisfying 



conditions (i), (ii), (Hi) from Section 3.1. Let Lq be a dyadic interval, and let for all 
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Figure 2. Proof of Lemma IbTU 

I £ chldfe(Io), < A; < ?i, the points Xj = (f^, g^, F^, G^, u^, v^) G Domi3^ be given. 
Assume that Xj satisfy the dyadic martingale dynamics, namely that if Ii, I2 are the 
children of I, then 



Then for A' = 4. 5 A we have 

E If/- 

/echid„(/o) 




\ /echid„(/o) 

6.1. Plan of the proof. Let us first explain the idea of tlie proof informally. We will use 
the language of random processes (martingales) as the most convenient for the informal 
explanation, but the formal proof will be completely elementary. 

So, we a given a dyadic martingale Xj I € where at each point we have 2 choices 
with equal probability, and we want to estimate below the difference between B^, (Xj) at 
the initial moment I = Iq and its expected value after n steps. 

An immediate idea would be to use the property (iii) of the Bellman function to estimate 
expected loss at each step. But this would not work: the conditional expectation of the 
loss when going from the state Xj to the states X^^ , Xj^ , where Ii^2 are the children of / 
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is estimated below by c\{j^ ~ ^/l " IS/^ ~ S7I1 this quantity can be small, it even can be 
zero: imagine a situation when at each step only one of the variables f or g is changed. 

So, the main idea is to change the martingale, preserving the starting point X^^ and the 
final values and probabilities. 

Namely, from the starting point X^^ we move with probabilities 1/2 to the points X"*", 
and X-, X± = X± = (f±,g±,F±,G±,u±,v±), Xj^ = {X+ + X-)/2. We wih move 

"sufficiently far" , so |f^ — f^-^ | and |g^ — g^^ | are comparable to 2^" X^/gchid„(7'o) 1^/ ~ ^lo I 
and 2~" ^jg^jjj^^^j^-j |g^ — g^^| respectively, but "not too far", so the points X^ will be 
"almost averages" of the points Xj, I G chld„(/o). 

Since we moved sufficiently far, we get the amount we need at the first step. Moreover, 
since the points X^ are "almost averages" of the points Xj, I G chld„(Io), we can start 
the process from the points X^ to end up after n steps at the points Xj, I G chld„(/o). 

Of course, it will not be possible to get from X~^ (or X~) to the end points X^, I G 
chld„(/o) via the standard dyadic martingale, when at each point we have 2 choices with 
equal probability. But since the first step was sufficiently small, it will still be possible to 
get to the desired endpoints via a binary martingale, where at each point we have 2 choices 
with not equal but with almost equal probability. 

It might be intuitively clear from the symmetry of X^, that after moving from Xj to 
it is possible to start the martingale from X^ so we get from Xj^^ to the endpoints 
with equal probability. More precisely, we will get from X'^ to X^, I G chld„,(/o) with 
probability 2~"(1 + a^), {a^l < 1/3, and we get from /~ to the same point / with the 
probability 2~"(1 — Oj), so the probability of getting from Iq to I will be 2~". 

If this reasoning is not intuitively clear to the reader, he or she should not worry, because 
the rigorous proof (not requiring any probability) will be presented later; the probabilistic 
interpretation will guide us through the formal calculations. 

6.2. Formal proof: another technical lemma. Let us recall a theorem by K. Ball [21 
Theorem 7], solving the famous Tarski plank problem for convex bodies. This theorem also 
can be treated "multiple Hahn-Banach Theorem". 

Theorem 6.2. Let xt he unit vectors (\\xk\\ = 1) in a (real) normed space X, and let 
rrik G M, Wfc > such that 

k 

Then there exist a functional x* G X* , \\x*\\ < 1 such that 

\{xk, X*) — rrikl > Wk for all k. 
Note that the number of vectors here can be infinite. 

This theorem with 2 vectors, m/- = 0, wi = W2 = 1/2 gives as the following simple 
lemma that we will use. 
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Lemma 6.3. Let X be a (real) normed space, and let a,b G X, \\a\ 
exists X* S X* , \\x*\\ = 1 such that 



1 . There 



\{a,x*)\>l/2, \{b,x*)\>l/2. 

Note, that for our purposes we just need this lemma with some constant, not necessarily 
with the optimal one 1/2; we just get a worse constant in Theorem 4.1 A proof of this 
lemma with some constant is an easy exercise in elementary functional analysis. 



6.3. Formal proof: the first step. Let N = 2", and let £^ be the space 



pN 



endowed 



with the norm. Define e G i^, e = (1,1,..., 1). Consider the quotient space X 
span{e}. For x e i]^ let x^ := x — N^'^{x, e)e. Then 



(6.1 



X ^ < \\x 

I II ^ II 



< 2||x|| . 



Indeed, the first inequality is trivial (follows from the definition of the norm in the quotient 
space). As for the second one, |(2;,e)| < = N, so it follows from the triangle 

inequality that 

llx'^ILi < llxlLi + A^~"^|(, X, e)| llelLi < 2||x|Li . 

This inequality remains true if one replaces x by x — ae, a G M, so the second inequality 
in (6.1 ) is proved. 

The dual space X* can be identified with s subspace of 
that (e,x*) = (with the usual £^-norm). 

Applying Lemma |6.3| to the above space X with a and b being the normalized vectors 
jfr — f, |, ^ and |e, — Kr , , , /r X respectively, we get that there exists x* = 

I Jo /echld„(/o) '-°J °Io' lechldn(Io) ^ J) o 

{a^}/echid(7o) such that 



^ consisting of x* G l'^ such 



E 



a. 



0, 



/echid„(7o) 



lo' 



(6.2) 

(6.3) 
and 

(6.4) 
(6.5) 

Note that Lemma 

constant 1/6, and the second inequality in (6.1) give as an extra factor 1/2. 

Below, all identities (inequalities) with it mean that there are two sets of identities 
(inequalities): one with + and the other with — . 



/echid„(/o) 



V/ G chld„(/o 



-12 ^ ' ^ 

/echid„(/o) 



E «/(g/-g 



7echld„(7o) 



> 



1 

12 



E 



/echid„{/o) 



6.3 



gives the comparison with the norm in X = £]^/ spane with the 



SHARP A2 ESTIMATES OF HAAR SHIFTS VIA BELLMAN FUNCTION 



15 



Define = Xf = (f±, g±, F±, G±, u±, v±) by 



lo 

76chld„{/o) /echld„(/o) 



soXj^ = {X+ + X-)/2. 



Note that (6.4), (6.5) and the identities 



/echld„(/o) 



8/0 = 2- 



/echid„(7Q) 



imply that 



if^ -f, I > —2-" y If, -f, I, 
19 ^ 'I ^ " 



/o I - 12 



/echld„(/o) 



/o I - 12 



/echid„(/o) 



Let 



:= 1 ± a^; 



note that 2/3<Xj< 4/3. 

6.4. Formal proof: modifying the martingale. For / G chldfc(/o), 1 < k < n define 



.6) 



\ 



Jechld„(7o): 

Jci 



( 



J 



\ 



X 



. Jechld„(7o): I 



Note, that for I G chldn(/o) we have X^ = X^ = X^, where Xj are the points given in 
the statement of the lemma. 
For / G chld(ir) define 



(6.7) 



\ 



E 



X . 



. Jechld„(/o): I 
\ JCI I 



\ 



E 



. Jechld„(/o): I 
\ JCK I 



Note that (9^ > 0, and it Ii, h are the children of /, then 
(6.8) 



i.e. X^ (respectively X^ ) is in the interval connecting X^ and (respectively and 
X~\. From the probabilistic point of view, and are the probabilities of moving from 



X^ to X^ and Xj^ respectively. 



h 

It is not hard to show that 9^ cannot be too close to or 1, but we do not need this 

-11,2 

fact for the formal proof. 
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Identity ( |6.6| ) means that the points are in the convex huh of the points Xj : 
J £ chld„(/o), J C /. We claim that moreover, Xf G Dom(^^^) for all / G chldfc(/o), 
< A; < n. 

Indeed, since X^ are in the convex hull of the points Xj G Dom(S^) C DomB^^, and 
among the constrains defining Dom(;B^^) only the constrain 

uv < AA 

is not convex, we only need to check this constrain. 

Notice that equation (6.6) with all replaced by 1 gives us Xj, which belongs to 
Dom(i3^). Let us look at the u-coordinate of X^: we can easily see that 



4 ,2 



J - 73/3 /> 



because the maximal possible numerator in the u-coordinate of (6.6) happens when all 
=4/3 and the minimal possible numerator when all x-^ = 2/3. The same holds for v^, 
so u^v^ < 4uj-v^ < AA, thus Xf £ Dom(i3^^). 

We can also show that if Ii and I2 are the children of /, then the centers of the intervals 
[X^ ,X^] also belong to B^j^. Indeed, we know that Xj = {Xj^ + Xj^)/2 (the center of 

[Xj^,Xj^]) belongs to B^. As we discussed above, 

ut < 2u, , vt < 2v, . 

h,2 ^1,2 h,2 ^1,2 

Therefore, if and are the u and v coordinates of the centers of the intervals \X^ , X^], 

h h 

we can conclude that 

< 2\ij, < 2v^ 

thus the centers of the intervals [Xf,Xf] also belong to B.. (as we discussed above, we 
only need to check the non-convex constrain uv < AA). 

Therefore, by Lemma 5.1 the intervals [Xf , Xf] are in ;B , where A' = 4^(9/8) = 4.5A. 
I I h I2 ^ 

6.5. Conclusion of the proof. Now we are ready to complete the proof. By property 
(iii) of the Bellman function, using the fact that |f+ - f"| = 2|f=^ - f | 



It follows from the concavity of B^, and (6.8) that if Ii and I2 are the children of /, then 

B^,{x^)>elB^,{xl) + elB^,{x%); 

recall that as we discussed above, the whole intervals are in Dom(S . ). 
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Let US apply this inequality to Iq, then substitute in the right side the inequalities for 
the children of Iq, and so on. Then, using the fact that for I £ chld„(/o) 

J&Sl:IC.J<^Io \Jechld„(/o) / 



we get the inequality 



/echid„{/o) 



note that we are using Xj instead of X^ in the right side, because X^ = X^ = Xj for 
I G chld„(/o). 



Substituting this inequality into (6.9) and taking into account that + =2 for 
I G chld„(/o), we get 



(6.10) 



|f^-f,J-|g^-gJ<|k,(Xj-2-'^ Y: B^,iX^)] 

\ /echid„(/o) / 



The estimates (6.4) and m.5h mean that 



(6.11) 2- |f/-f7ol^l2|f±-f,J, 

/Gchld„(7o) 

(6.12) 2-" Y |g,-g,J <12|g^-g,J 

/Gchld„(/o) 



Combining these estimate with (6.10) we get the conclusion of the lemma. □ 



7. Proof of the main result (Theorem 4.1) 



As we discussed above, we need to estimate the "slices" of the Haar shift, or equiva- 
lently, their models on the real line M. Let § = be such a model of a slice, and let n be 
its complexity. As it was shown before 

\{^kf,9)\< Yl |/r'iiA;/iiiiiA^5iii 



For / G ^ let X^ := ((/)^, (5),, {gw~^)j, H,, {w-^)j). 

Define A := [w]'^^, so X^ G Dom(^^) for all / G ^. Lemma 



6.1 



states that 



Jechid„(/) 
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Writing this estimate for each J G chldn(/), then repeating this m times we get 

J2 |J|-^||A;;/||i||A;;g||i<72[|/|s^,(x,)- \J\^a'(^j 

J&Ck-.JCl \ jGchld™„(/) 

e{j)>2-"'^e{i) 

<i.5- 72 ■CA{fw)y^{g^w-y/^\I\ 

= 324C7^||/1^ 11^2(^)1191^ 11^2(^-1) 

where C is the constant from the property (ii) of the Bellman functions B^; here the second 
inequality holds because Ba'{Xj) < d'Fj/^Gj/s = A.^CAFy^Gy^ and 13^,{Xj) > by 
property (ii) of the Bellman function. 
Letting m — t- oo we get 

|Jr'||A;;/||i||A;;5||i < 324c^||/i^||^2(^)||5i,iU2(^-i) 

JeCk-.Jci 

Covering the line by the intervals / G £ of length 2*^ and applying the above inequality to 
each / we get 

Y |JriA;;/||i||A;;5||i < 324^^11/11^2(^)11511^2(^-1), 

JeCk:\J\<2M 

and letting M — t- oo we get that the norm of each slice is bounded by 324CA. Recall, 
that we had n slices. Recall that A is the A2 norm of the weight w transferred to the real 
line M. As it was discussed above in Section 5.1.21 it is estimated by 2^'^~^[w]^^, where 



[w]^ is the A2 norm of the original weight w in'i 

Gathering everything together we get the conclusion of the main result (Theorem 4.1). 

8. Estimates of the paraproducts 

As it was mentioned before in the Introduction, to prove the A2 conjecture for gen- 
eral C alder 6n-Zygmund operators, besides getting linear in [w]'^^ and subexponential in 

complexity estimate for the Haar shifts, one also need a linear in [w]'^^ estimate of the 
so-called paraproducts. Since the paraproducts have a fixed complexity 1, one does not 
care about growth of the estimates with complexity, and any (linear in [^^l^^) estimate of 
the paraproduct, for example one obtained in [13] would work. 

Here we would like to show how using Bellman function approach to get linear in [w]^^ 
estimate of a general paraproduct from the estimate for the simplest paraproduct in M 
obtained by O. Beznosova [3j. 

Let us recall the main definitions. Let a dyadic lattice ^ in M*^ be fixed. 

Definition 8.1. Let be a locally integrable function. A paraproduct H^p with symbol ip 
is defined by 
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It is well known that the paraproduct li^p is bounded in (unweighted) if and only if 
e BMO'^, i.e. if and only if 



1 V ^ 1 1 . 1 1 2 II 1 1 2 
17 II yriiz ll'^llRMDd 



This statement is in fact equivalent to the dyadic Carleson Embedding Theorem, and from 
the sharp estimates in the Embedding Theorem one can get that norm of H^p is bounded 
by 2||w|| . . 

As for the linear in [w]'^^ weighted estimates of the paraproducts, it was proved by 
O. Beznosova in [3j that for the paraproduct on the real line 

where Ci is an absolute constant. 
This result can be rewritten as 

(8.1) K/),IIIA,^lbK.),, - K.)J • 1/1^/^ < c[< ll9^ll3,oJI/ll,.(^)lkl,,.(^_.) 

for all / G L'^{w) and all g G L?'{w^^ (because changing the signs in the decomposition 
if = Yliii^si does not change llv^llgp^Qd )• Here || • II2 is the unweighted norm, Ii and 
I2 denote the children of / and C = Ci/2. b 

We want to extend this result to W^. We will show how to prove the following proposition 

Proposition 8.2. Let (p G BMO^{W^) , and let 11^ he the corresponding paraproduct in 
L2{W^). Let a weigh w in satisfies the dyadic Muckenhoupt condition Then 11^ is 
bounded in L'^(w) with the norm at most 1622'^^~'^C\\(p\\ . \w]'\ , where C is the constant 
from (8.1 ). 

We present the proof of this result using the Bellman function from the result for the 
real line. Note that because of homogeneity it is sufficient to prove Proposition 8.2 only 
for ||(/?|| J < 1. 

8.1. Bellman function for paraproducts on the real line. For an interval /q G let 
us fix the quantities 

(8.2) (/)/o=f' (/'^)/o=^' ^5),„=g, {g^w~^)j^=G 

(8.3) M/o=u, (^"')/o=^- 

(8.4) M = \h\-^ \\^M\l 

and define the function 13^{{, g, F, G, u, v, M) as 

^^(f,g,F,G,u,v,M) = l/ol-'sup K/)/l • \\^M\2\{9)j^ - \{9)jJ ■ 

I&9:IClo 
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where the supremum is taken over all / G L'^{w), g £ L'^{w~^, all weights w, ['w]'\^ < A 
and all ip G BMO'^, llv'llgj^Qd < 1^ satisfying (fs^-dsl). 



8.1.1. Properties of Bellman function. 

(i) Domain Dom(;B^) is given by the conditions 

u, V > 0, 1 < uv < yl, f 2 < Fv, g2 < Gu, < M < 1; 

this simply means that for any choice of the appropriate functions the correspond- 
ing averages satisfy these constrains and that for any 7-tuple of reals satisfying 
these constrains there are functions with corresponding averages over Iq, so the 
supremum is well defined (not — oo). 

(ii) Range: 

< B{X) < CAY^'^G^/^. 

(iii) The main inequality: Let X = (f , g, F, G, u, v). Then for all 

(X,M),(Xi/2, Mi,2) e Dom(^^), 

such that X = {Xi+ X2) /2, M - (Mi + M2) /2 =: d > 0, the following inequality 
holds: 

B^{X,M) - {B^{Xi,Mi) + BjX2,M2)) /2 > d|/| • |gi - gi|. 
The proof of the properties of the Bellman function is pretty standard, one can do it 



following the lines of Section 3.1.1 



8.2. The main estimate. The proof of Proposition 8.2 follows easily from the lemma 
below. The details of the reduction are essentially the same as for the Haar shifts (slicing, 
remodeling on the real line and then estimating each slice) , so we leave it as an easy exercise 
for the reader. 

Lemma 8.3 (The main estimate). Let Ba, A > 1 he a family of functions satisfying 



conditions (i), (ii), (iii) from Section 8.1.1. Let Iq be a dyadic interval, and let for all 



I £ chldfc(/o); < k < n, the points X^ = (f^, g^, F^, G^^, u^, v^), M^, {Xj G Dom;B^ be 
given. Assume that Xj satisfy the dyadic martingale dynamics, namely that if Ii, I2 are 
the children of I, then 

Assume also that Mj = {M^^ + Mj^)/2 for all I except Iq, and let 

Mj, - 2- E =: d,^ > 0. 

/echid„(/o) 



Then for A' 
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\ /echld„(/o) 

\ /echid„(/o) 

To prove this lemma one can just literally follow the proof of Lemma |6.1[ The main 
difference (a simplification) is that when we are picking weights a^, < 1/3 for the first 



step, we only need to get the estimate (6.5), which can be done trivially (because we only 
need to control one parameter) and with the constant 1/6 instead of 1/12. This give the 
constant 36 instead of 72 in the conclusion of the lemma. 
Then we define 

(X±,M±) = (X±,M±) = 2- Yl i^r^i)' 

/echid„(7o) 



and the rest of the proof is the same as for Lemma 6.1 with replaced by (X^M^). 

□ 

9. Concluding remarks 



The heart of this paper is definitely Lemma 6.1 , which is a very general fact about convex 



function. Concave functions with concavity estimated by |Af| • |Ag| are very common in 
the dyadic Harmonic analysis: they appear if one writes the estimates of the bilinear form 
of dyadic martingale multipliers T^-. 



Lemma 6.1 gives a simple way to transfer the estimates from the dyadic martingale 
multipliers to more general martingale transforms, like dyadic shifts, etc. As the story of 
the A2 conjecture illustrates, such transference until now was considered highly non-trivial, 
even in the case of simplest possible non-multiplier martingale transforms (like the simplest 
Haar shift considered by S. Petermichl). 

It also probably worth mentioning that for the second degree polynomials an analogue 



of Lemma 6.1 was known for some time. Namely, it is a known fact (cf. [6j) that if 



Q[. . . ,x,y, . . .] is a quadratic form such that 

Q[...,x,y,...] > 2\xy\, 

then there exists a > such that 

Q[. . . ,x,y, . . .] > ax^ + oT^y'^. 

Then for 9k >0 



Y OkQl ..,Xk,yk,..-]>aY Okxl + a-^ Yl ^^vl > 2 ^^^'1 f E 

k k k \ k J \ k 



1/2 / \ 1/2 
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Therefore, if $ is a second degree polynomial of variables X = {. . . ,x,y, . . .) such that for 
X = {Xi + X2)/2 

$(X) - ($(Xi) + ^>(X2))/2 > |xi - X2\ ■ \yi - 2/21 
then for any 9^ > 0, Ok = ^ and for any X, Xk such that X = J2k ^kXk, 

\ 1/2 / \ 1/2 



A: V fc 



^ k / 



If we take 2" ter ms an d put 9k = 2~" we get (for the second degree polynomials) a stronger 
version of Lemma 6.1 , where we estimate above not the product of norms, but the bigger 
product of norms. 
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